Inner-imaging 3D attention module for residual network

نویسندگان

چکیده

Abstract We propose an Inner-Imaging three-dimensional (3D) attentional feature fusion module for a residual network, which is simple yet effective approach networks. In our attention module, we constructed 3D soft map to refine the input feature. The fuses features from different dimensions, including channel and spatial axes, create map. Then, implemented further fuse features. Lastly, outputs that applied branch. can also model relationship between dimensions achieve interaction This function allows acquire more To demonstrate effectiveness of method, extensive experiments were conducted on several computer vision benchmark datasets, ImageNet 2012 Microsoft COCO (MS COCO) 2017 datasets. experimental results show method performed better than baseline methods in tasks image classification, object detection, instance segmentation tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3D imaging and modeling of the middle and inner ear

The bones of the middle ear are the smallest bones in the body and are among the most complicated functionally. They are located within the temporal bone – rendering them difficult to access and study. An accurate 3D model can offer an excellent illustration of the complex spatial relationships between the ossicles and the nerves and muscles with which they intertwine. The overall objective was...

متن کامل

RRA: Recurrent Residual Attention for Sequence Learning

In this paper, we propose a recurrent neural network (RNN) with residual attention (RRA) to learn long-range dependencies from sequential data. We propose to add residual connections across timesteps to RNN, which explicitly enhances the interaction between current state and hidden states that are several timesteps apart. This also allows training errors to be directly back-propagated through r...

متن کامل

Recurrent Residual Module for Fast Inference in Videos

Deep convolutional neural networks (CNNs) have made impressive progress in many video recognition tasks such as video pose estimation and video object detection. However, CNN inference on video is computationally expensive due to processing dense frames individually. In this work, we propose a framework called Recurrent Residual Module (RRM) to accelerate the CNN inference for video recognition...

متن کامل

Augmented laminography, a correlative 3D imaging method for revealing the inner structure of compressed fossils

Non-destructive imaging techniques can be extremely useful tools for the investigation and the assessment of palaeontological objects, as mechanical preparation of rare and valuable fossils is precluded in most cases. However, palaeontologists are often faced with the problem of choosing a method among a wide range of available techniques. In this case study, we employ x-ray computed tomography...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied Intelligence

سال: 2022

ISSN: ['0924-669X', '1573-7497']

DOI: https://doi.org/10.1007/s10489-022-03225-9